Deep Learning for Genomics: A Concise Overview

نویسندگان

  • Tianwei Yue
  • Haohan Wang
چکیده

Advancements in genomic research such as high-throughput sequencing techniques have driven modern genomic studies into ”big data” disciplines. This data explosion is constantly challenging conventional methods used in genomics. In parallel with the urgent demand for robust algorithms, deep learning has succeeded in a variety of fields such as vision, speech, and text processing. Yet genomics entails unique challenges to deep learning since we are expecting from deep learning a superhuman intelligence that explores beyond our knowledge to interpret the genome. A powerful deep learning model should rely on insightful utilization of task-specific knowledge. In this paper, we briefly discuss the strengths of different deep learning models from a genomic perspective so as to fit each particular task with a proper deep architecture, and remark on practical considerations of developing modern deep learning architectures for genomics. We also provide a concise review of deep learning applications in various aspects of genomic research, as well as pointing out potential opportunities and obstacles for future genomics applications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Overview of Deep Neural Networks

In recent years, new neural network models with deep architectures started to get more attention in the field of machine learning. These models contain larger number of layers (therefore ”deep”) than conventional multi-layered perceptron, which usually uses only two or three functional layers of neurons. To overcome the difficulties of training such complex networks, new learning algorithms hav...

متن کامل

Pharmacological properties of some 3-substituted indole derivatives, a concise overview

Indole is a nitrogen-containing heterocycle. It is a very important motif in agriculture and pharmacy. Many compounds containing indole moiety has been isolated form nature. It is also an important part in natural alkaloids. Tryptophan is an amino acid which posses indole. 3-Sustituted indoles are the main group of its derivatives. Because the wide-spread application of 3-substituted indolic co...

متن کامل

Nanoliposome Potentials in Nanotherapy:A Concise Overview

Liposomes have attracted great interest as efficient carriers for nutrients, drugs and other bioactive agents as well as ideal models for biological membranes. This article intends to provide an overview of liposomes and nanoliposomes definition as well as their properties and preparation methods. Also it elaborates on various applications of nanoliposomes in nanotherapy including diagnostics, ...

متن کامل

Modeling positional effects of regulatory sequences with spline transformations increases prediction accuracy of deep neural networks.

Motivation Regulatory sequences are not solely defined by their nucleic acid sequence but also by their relative distances to genomic landmarks such as transcription start site, exon boundaries, or polyadenylation site. Deep learning has become the approach of choice for modeling regulatory sequences because of its strength to learn complex sequence features. However, modeling relative distance...

متن کامل

Learning the Localization Function: Machine Learning Approach to Fingerprinting Localization

Considered as a data-driven approach, Fingerprinting Localization Solutions (FPSs) enjoy huge popularity due to their good performance and minimal environment information requirement. This papers addresses applications of artificial intelligence to solve two problems in Received Signal Strength Indicator (RSSI) based FPS, first the cumbersome training database construction and second the extrap...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1802.00810  شماره 

صفحات  -

تاریخ انتشار 2018